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Abstract. We consider a trader who wants to direct his portfolio towards 
a set of acceptable wealths given by a convex risk measure. We propose a 
black-box algorithm, whose inputs are the joint law of stock prices and the 
convex risk measure, and whose outputs are the numerical values of initial 
capital requirement and the functional form of a trading strategy to achieve 
acceptability. We also prove optimality of the obtained capital. 

Key words: Measures of risk, VC-dimension, portfolio optimization, Neyman- 
Pearson lemma. 

1. Introduction 

1.1. Objective. In this paper, we consider a T period market model, with a single 
stock and a money market. To model uncertainty in the stock price movements, we 
consider a probability space (O, J 7 , P) and a filtration T§ C T\ C . . . C Tt Q T. At 
every time point t = 0, 1, 2, . . . ,T, the discounted price of the stock, St, is assumed 
to be an integrable random variable measurable with respect to Tt- 

Next we consider a convex measure of risk. In the following subsection we briefly 
discuss the definition and significance of such a measure. Here it suffices to define it 
in the following way. Let {Qi}, % = 1, . . . , m, be a collection of probability measures 
on the sample space (f2, T) which are absolutely continuous with respect to P, with 
Radon-Nikodym derivatives 

(1) {fi = dQJdP}. 

We are also given a collection {a{\ of real numbers. For every random variable 
X e n i L 1 (Q i ), define 

(2) p(X)= sup [E Q *(-X) + ai ] = sup [-E(Xfi) + Oi] . 

l<.i<m l<i<m 

We call such a p to be a convex measure of risk. 

Let us now introduce an agent who follows a self-financing portfolio by holding £ t 
number of shares in between time periods t and (t+1). Due to the non-anticipative 
nature of trading, each £ t is an J^-measurable random variable. For any choice of 
initial capital w , and strategy (£o>£i; ■ ■ ■ >£t-i)> let V(wo,£) denote the discounted 
terminal value of the portfolio, i.e., 

(3) V(wo,t) = w +W(0, where 

T-l 

(4) w(0 = ]T6(s t+ i-S0- 

t-Q 
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In this paper we investigate an algorithm to compute a near-minimal wo and strat- 
egy £, such that p(V(wo,£)) < 0. We shall then say that V(wo,£) is acceptable. 

Our objective is indeed numerical computation, and not just theoretical expres- 
sions. We do not impose any restrictions on the law of the price process S. However, 
we do assume the existence of {!Ft, P)-integrable random variables a t and b t such 
that the agent is forced to obey 

(5) at<Zt<h, Vt = 0,l,...,T-l. 

This is often a natural assumptions dictated by trading constraints. In any case, 
this is crucial for our analysis. 

The literature on convex measures of risk is almost silent about computing strate- 
gies to achieve acceptability. The primary difficulty being that the terminal con- 
ditions on the portfolio are not given by almost-sure equalities/inequalities. This 
prevents the use of classical change-of-measure techniques. In this paper, we take 
an novel computational approach, combining the theory of Uniform Law of Large 
Numbers with standard Monte-Carlo simulations. 

1.2. A brief history of the literature. In recent times, the theory of measures 
of risk has generated a lot of interest in the mathematical finance literature, partly 
because it makes a rigorous assessment of risks associated with random financial 
net worths, and partly because it generalizes No- Arbitrage asset pricing and super- 
hedging ideas in incomplete markets. 

One of the first articles to define and study such measures is the seminal paper 
ADEH99 , which provides a definition and justifies a unified framework for analy- 
sis, construction and implementation of measures of risk. As the authors point out, 
these measures of risks, named coherent measures, can be used as extra capital re- 
quirements, to regulate the risk assumed by market participants, traders, insurance 
underwriters, as well as to allocate existing capital. The idea is twofold: first to 
stipulate axioms which define acceptable future random net worths, and secondly, 
to define the measure of risk of an unacceptable position as the minimum extra 
capital which, invested in a 'pre-specified reference investment instrument', makes 
the future discounted value of the position acceptable. The axioms defining accept- 
ability do not specify a unique measure of risk, instead, they characterize a large 
class of risk measures. The choice of precisely which measure to use from this class 
has to be determined from additional economic considerations. 

A significant extension was made by introducing convex measures of risk in 
FS02 . A similar set-up, as in [5DEH99 , is considered. However the authors argue 
that the positive homogeneity of the coherent risk measure is an undue requirement, 
because the risk of a position might increase in a non-linear way with the size of 
the position. They suggest to relax the conditions of positive homogeneity and of 
subadditivity and to require the weaker property of convexity. 

In both papers, the basic objects of study are random variables on the set of 
states of nature at a future date, interpreted as possible future (discounted) values 
of positions or portfolios currently held. A supervisor (e.g. regulator, exchange's 
clearing firm, or investment manager) decides on a subset of such future outcomes 
as acceptable risks. In other words, they choose a subset A of a suitable set of 
real functions, L°, on a set ft, and call it the acceptance set. A measure of risk 
associated with A is a function p A : L° — > R, defined by 

p A (X) = inf{m | m + X G A}. 
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Conversely, for any function p : L° — ► R, one can define a corresponding acceptance 

set by A p = {X 6 L° | p(X) < 0}. Such a function, p, will be called a convex 
measure of risk, if it satisfies the following axioms: 

• Translation invariance: for all X € L°, and a £ M, we have p(X + a) = p(X) — a. 

• Monotonicity: for all X and Y in L° with X > Y, we have p(AT) < p(Y). 

• Convexity: for all X and V in L°, and all A S [0, 1], we have 

(6) p(\X + (1 - A)Y) < Ap(X) + (1 - A)p(K). 

Why these axioms are natural requirements for a measure of risk has been argued 
in |ADEH99I Secti on 2.2 ] and |FS02| . and we skip such details. 

The authors of FS02 then prove a representation theorem, similar in spirit to 
one in |ADEH99] , which shows that any convex measure of risk on a finite ft is of 
the form 

(7) p(X)= sup (E p [-X] + a(P)). 

Pev 

Here, the set V is the set of all probability measures on fi. The function a(-) is 
a certain penalty function on V which takes values in lU {— oo}. (Here we stray 
from the usual convention where a in J7J is replaced by —a.) Representation (J7J 
was independently proved by David Heath in HeaOO . As before, a convex measure 
of risk defines an associated acceptance set given by 

(8) A p = {X € L° | p(X) < 0} = {X e L° | E P [X] > a{P)} . 

Broad extensions of J7J can be found in FS04 , all of which exhibit the same 
structure. 

Similar notions as above started appearing simultaneously from very different 
contexts. In a now well-known paper, |CGM0l| . the authors use the notion of 
acceptability to present a new approach for positioning, pricing, and hedging in 
incomplete markets that bridges standard arbitrage pricing and expected utility 
maximization. Also the theory of no-good-deal pricing (NGD), as a pricing tech- 
nique based on th e absen ce of attractive investment opportunities in equilibrium, 
was introduced in |CH0lj . The term no-good-deal is borrowed from an earlier paper 
with similar objectives, CSR00 , where good-deals were defined by high sharp-ratio 
of returns. The first paper which fully establishes the link between coherent risk 
measures and the NGD pricing theory is JKOT] , who shows that convex risk mea- 
sures are essentially equivalent to good-deal bounds. Relations between measures 
of risk and NGD are further extended by Staum in |Staf)4| , where he proves funda- 
mental theorem of asset pricing for good deal bounds in incomplete markets. 

All these diverse motivations can be assimilated by considering what authors of 
FS02 call measure of risk in a financial market. Several authors have recently 
contributed to the development of this theory, e.g., |BEK05a) and BEK05b', who 
establish these risk measures as special cases of inf- convolution of risk measures. 
Consider the setting in the last subsection, in particular, the notations in J2J and 
Q. The minimum wq for which inf^ p(V(wo,^)) is non-positive can be thought of 
as a price one has to pay today for achieving acceptability in future. As is shown 
in | FS02| . for any random variable Z, one can choose the penalty function suitably 
such that the minimum wq is the market measure of risk of Z. This duality between 
price and risk measures is also seen in NGD pricing. If a strategy £ exists which 
achieves the infimum above, then, it can be thought of as a hedging strategy in the 
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NGD setting. In any case, it can be thought as a strategy to achieve acceptability 
in the future, starting from a currently non-acceptable portfolio. 

Our risk measure, p in is clearly convex. We restrict ourselves to finite sets 
{Qi, • • ■ , Qm} to aid computation. This can be interesting either in its own right, 
or as an approximation to the general case. The assumption Qi <C P is implied by 
the natural requirement: p(X) = p(Y) if P(X = Y) = 1 (see, e.g., FS04|). 

1.3. Summary and organization. We propose our main result in the following 
section. First, we suppose that for a given wq, the set of strategies £ which satisfy 
(0, and for which p(V(wq,£ i )) < is non-empty. Then, Proposition 12.11 proves 
that the intersection of this set with a specific, much smaller family of strategies 
is also non-empty. This smaller set of strategies is indexed by a finite-dimensional 
space, and has nice combinatorial properties. This allows us to use the theory 
of Uniform Law of Large Numbers (ULLN), and devise a Monte-Carlo scheme to 
numerically compute a near-minimum wq and a corresponding strategy £ to have 
p(V (wq , £)) non-positive. In Section |3 we describe the method, and give precise 
error bounds on such approximations. In Section 0] we consider a natural example 
in which stock price follows discrete geometric Brownian motion, and show how 
our method leads to numerical values of both near-optimal capital and strategy to 
achieve acceptability. 

1.4. Acknowledgments. I thank Prof. Peter Bank for suggesting the particular 
example in Section^ 



2. Main results 

Recall that m refers to the number of probability measures in the representation 
of p in J2J). Let C be the collection of adapted processes £ = (£o, • ■ ■ , £r-i), which 
satisfy (JjJJ. Define the following set: 

(9) ft={(E Ol (W(0),...,E«»(W(0)), de£}. 

For any k £ N and any x £ define the upper quantant of x, denoted by Q x , as 
the set 

(10) Q x = {y G R k : Vj > Xj for j = 1, 2, . . . , k). 

The dimension is suppressed in the notation for Q Xl since it is obvious from the 
dimension of x. 

Proposition 2.1. Fix a wq G R. Let zq — (cki — wq, . . . , a m — wq). Assume that 
the convex set Q Zo n 1Z has a non-empty relative interior. 

For every 1 < i < m, define the adapted sequence of random variables 



(11) v t (fi) = (b t - a t )E (5t+i-5 t )/i 



< = 0,1,...,T-1. 



For every r € W 11 , consider the following weighted sum process 

m 

(12) A t (r)= J2 rMfi), t = 0,...,T-l. 

i=i 

Now, let r\ be any continuous probability distribution function on the real line with 
finite first moment. Then, there exists a vector r* G M. m , such that the {Tt\-adapted 
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process 

(13) £(w) = (&t-otMA t (r*))+ot, t = 0,...,T-l, 

satisfies 0) and p(W(£*)) < w . 

Remark 1. Note that assuming Q 2o Pi 7^ being non-empty is equivalent to assuming 
the existence of a strategy £ such that p(wo + W(£)) < 0. In the above proposition 
we assume a bit more than that. 

The proof of this result will follow after we have introduced some notations. Let 
[T] denote the set {0,1,..., T— 1}. Enlarge the original sample space by considering 



(14) ft X [T] = ft X {0,1,..., T- 1}. 

Let V [T] be the power set of the finite collection {0, 1, . . . , T - 1} and let T ® T [T] 
denote the product cr-algebra between T and V^ T >. Extract a sub cr-algebra T by 
defining 

(15) f= \A e T(g>V [T] {uj -. (to,t) e A} e F t , V t = 0, 1, . . . ,T- l} . 

That T is a valid cr-algebra is straightforward to verify. Finally, let Ut denote 
the discrete uniform measure on [T] , and consider the product measure P <S> Ut 

on the cr-algebra T. This gives us a probability space (ft x [T],T,P® U T ). The 

advantages of considering the above probability space is the following trivial lemma. 

Lemma 2.1. A process (/io, hi, ... , /it-i) is adapted with respect to (ft, T) if and 
only if the random variable H(uj,t) = h t {uj) is measurable with respect to the en- 
larged space (ft x [T] , T) . 

Proof. Follows from the definition of T . □ 

For all sequence {£ t } that satisfy © (i.e. £ £ £), let us make a change of variable 
4> = 7r(£), where 

(16) ^ t = T (f) t ^(f t -a t )/(& t -o t ) ) t = 0,...,T-l, 

then, each <f> t is JFt-measurable and P(0 < <f>t < 1) = 1. 

Now, the discounted terminal value of the portfolio in can be expressed in 
terms of the (f> = 7r(£) as 

T-1 



W(0 = W o 7T" 1 (0) = ( S t+i - St) \(b t - a t )<p t + a t ] 
t=o 

T-1 



t=0 
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Thus, for any suitably integrablc / defined on (SI, J 7 , P), one can write 



T-l 



fw(OfdP = E(W(Of) = ^E([(& t -o^(S t+1 - Wt + at^+x-^)]/) 

n—1 n — 1 

t=0 
T-l 

= ^E[« f (/W + C (/), 



t=o 



(18) 

where, we have named 



t=o 



(19a) 



w t (/)(w) = (6 t -a t )E (5 t+ i - 5 t ) / ^ t (w), and 



(19b) 



c(/) - E 



T-l 



t=0 



For t £ [T], if we now look at <f> and u as functions of two arguments [u>, i), i.e., 
(19c) 



(w,f) = &(w), «(/)(«;, i) =«*(/)(«>)> ^e^, 



then, by Lemma lTTl both <j> and f are J 7 - measurable functions on Ox [T]. Moreover, 
v(f) is P ® E/ T -integrable and P ® U T ({0 < (f> < 1}) = 1. Thus, from JTSJ, we can 
write 

T — 1 

(20) / iyo7r- 1 ( ( /»)/dP-c(/) = V / v t (f)MP = T [ Mf)&{P®u T ). 
J t=0 J JQx[r] 

Proof of Provosition \2. 1\ Let 77 be any continuous probability distribution function 
on the real line with finite first moment. Consider the probability space (R, B(R), rj), 
where B(R) is the Borel cr-algebra on R. Consider the following product space 



(21) 



n x [t] 



•F<g>B(R), P®U T ®V- 



Let us recall here that il x [T] , and .F are defined in (|14fl , (|15|l , and Ut is the discrete 
uniform measure on the set [T] = {0, 1, 2, . . . , T — 1}. Let Z be a measurable map 
from this product space to R, given by 



Z(ui, t, x) = x, 



en, te [T], x € 



Clearly, Z has distribution 77, independent of the cr-algebra T . 

Consider the functions /, = dQi/dP appearing in J2J), and define the following 
functions in L : (P ® Ut <8> 7?): 



(22) 



gi(u,t,x) = v(fi)(u,t), l<i<m, g m+1 = -Z, 



where the function v is defined in Ijl9a(l and H19c[l . Also define the constants 

7i = ("i -w - c(/i)) /T, i = 1, 2 . . . , m. 
The function c is defined above in 119b|l . 
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• Define $ to be the convex collection of all T ® B(K)-measurable functions <f> such 
that P ® Ut <8 ??(0 < 4> < 1) = 1. Let M denote the set of points 



M = 



<i>g m +i 



where the integrations are with respect to P^Ut®?]. Recall the assumption in the 
statement of the proposition that Q Zo PI 1Z has a non-empty relative interior. Since 
every strategy £ £ L corresponds to a by the linear mapping tt defined in (|16|1 , it 
follows that there is a point (qi, . . . , q m ) which is an interior point of M (~1 Q 1 . 
We look at the following maximization problem: find the maximizcr of 



In 



c/)g m+ id (Pig) U T <8> v) 

'fix[T]xR 

among all 4> € A C <£>, where .4 is defined by 



(23) 



.4 



<Pgid(P®U T ®r)) = Qi } ■ 

/Ox[T]xR J 

We use Theorem 5 on page 96 of |Leh86) . Part (iv) of this Theorem guarantees 
the existence of a solution (f>* of the above maximization problem which is of the 
form 

!!> 11 YhLi nv(f t ) + g m+ i > 0, 
°> if i + g m +i < 0, 

for some (ri, . . . , r m ) S K m . Recall that g TO +i = — Z. and that from the definition 
of the function v in (|19c(l . it is clear that each v (fi) is independent of Z. Thus 



(m \ r / m 

Y^Ah) = Z J= J T M z = J2 riV (M 



F d (P ® E/ T ) = 0, 



the integrand is zero being the consequence of the continuity of r\. Thus, the solution 
in (|24|) is actually 



(25) 



i if Y,Zinv(fi) > z 

otherwise. 



Now from the constraint <p* e A, we also get that / 4>*gid (P Ut <S) i]) = qi > 
7i, i — 1,2, ... ^m. In other words, J <fi*v(fi)d (P (gi /7t <E> ^) > 7i, or, by expanding 
7i, equivalently 

(26) T f E (> T) v{fi)d (P ® (7 T ) + c(/») > a, - icq, * = 
inx[r] v 7 

We have used Fubini above, where 



. . , m. 



E 



(f\£ y )(u),t) = {P®U T ®v) \ Y J r iv{fi)-Z>U\!F j(w,t) 



where the {.Ftj-adapted process {A t } is defined as in l(T2l . Thus, if we let 
£* = (6 4 -a t )?7(A t )+a t , * = 0, .. .,T- 1, 
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then, C = n- 1 (^[<f>*\P\) in the notation of (EJ. Thus, by (J2SJ) and HJ, we 
conclude that 

fw(e)fidP>ai-w , i = l,...,m, 
or in other words, p(wo + W(^*)) < 0. This proves the proposition. □ 

3. Computations 

For every s = (si, . . . ,s m ) <E R m , recall from Proposition l2.il the JF t -adapted 
process 

m 

(27) X t (s) =J2siV t (fi), t = 0,l,...,r-l, 

i=l 

and the derived process 

(28) 6(s) = {bt-at)ri(\t(s))+at, t = 0,...,T-l. 

For suitable wq, Proposition 12.11 proves the existence of an s £ M. m via which the 
process £(s) satisfies p(wo + W(£(s))) < 0, or equivalently, by translation invariance, 

What we shall do now is like a partial converse. Suppose we can compute 
p(W(£(s))) for every s € M. m . Then we can define wq by 

w :=infp(W(e(«))). 

If s* achieves the above infimum, then, clearly p(u>o + W^(C( S *)) < 0? an d wo is near 
minimal by Proposition ^. II 

The above procedure would work if we could theoretically compute p(W (£(s))) 
for every s € R m . This is often impossible. However, for any fixed s, we can 
estimate p(W(t;(s))) by Monte-Carlo simulations upto any desired level of accu- 
racy. We show in this section that it is possible to do a Monte-Carlo simulation 
to simultaneously approximate p(W (£(s))) for every s € R m with a uniform er- 
ror bound. The feasibility of our claim depends on the theory of Uniform Law of 
Large Tumbers and the related concept of Vapnik-Cervonenkis dimension which is 
a combinatorial property of the particular structure of {£t} in l|28l) . This theory is 
well-developed and we cherry-pick only the necessary results for our purpose. These 
have been stated in the appendix. Further references have also been provided for 
the interested reader. 

Once we have our estimation procedure ready, we can construct a finite mesh 
G within R m and approximate the value of p(W(£(r))), by (say) p(r), for every 
red. Note that the error in approximation in our Monte-Carlo procedure does 
not depend on the size of the grid, and we can make it as large and fine as we want. 
For that fine mesh G, let r* be a grid point which attains p(r*) = min r6( g p( r )- 

Let Wq = p(r*). Then, as we describe below, given any e, S > 0, with a very high 
probability greater than (1 — S), the choice of (wQ,£(r*)) satisfies 

pK+W)))<£. 

This gives a near-minimal initial capital for the problem of finding (wq , £) which 
satisfies (jSJ and p(w + W(£)) < e. 
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Central to computing p(W(£(s))), for any s £ <S m +i, is to compute E(W(£(s))-fi) 
for every that defines p. Now, from equation (XEJ, we can write 

T— 1 

E(W(£(a))/<) = E / «*(/*)^(s))dP + c(/ i ) 

= t[ r7(A( S ))«(/ 4 )d(P®^ T )+c(/ 4 ). 

JOx[T] 

(29) = t[ I{\(s)-Z >0}v(fi)d(P®U T ®r)) + c(fi). 

iilx[T]xl 

Here, as in the last section, Z is a random variable with law 77 independent of J 7 , 
and !{•} denotes the indicator of an event. 

We would now like to do a change of measure in (|29|l above with v(fi) as the 
'Radon-Nikodym' derivative. This is not possibly directly, since v(fi) is not nec- 
essarily positive. However, we can work separately with v + (fi) = max(u(/j),0) 
and v~(fi) = max(-«(/j),0), which denote the positive and the negative parts 
respectively. Hence, one obtains 

E(W(£(s))/0 - c(fi) = t[ l{X(s)-Z>0}v + (f i )d(P®U T ®r l ) 

Jnx[T]xR 

-T [ I{\(s)-Z>0}v-(fi)d{P®U T ®ri) 

Jnx[T]xR 

= df-(nt®r,){\(s)-Z>0} 

(30) -dr ■ (pT ® v ) {X(s) -Z>0}. 

Here we have introduced several probability measures on (Q x [T], J-), defined by 
their corresponding unnormalized Radon-Nikodym derivatives: 

(31a) dp,t/d[P®U T ) oc «+(/<), dtir/d{P®U T ) oc «-(/<), 

and the corresponding normalizing constants (multiplied by T): 

T-1 T-1 

(31b) df^^Elv+ifi)], d-^^E[V(/,)], i = l,2,...,m. 
t=o (=0 

If any of the constants in (|31bjl is zero, the corresponding measure becomes the 
zero measure and can be dropped from our analysis. For efficiency in computation 
we would like to keep track of the number of non-zero measures above by defining 

m 

( 32 ) N = E(V>o> + 1 H->o})- 

4=1 



Assumption 3.1. Throughout the rest of this section, we shall assume that 

(1) one can generate samples from the joint distribution of (So, Si, . . . , St), 

(2) the random variables vt(fi) (thus also At) can be evaluated given the values 
of (S , . . . , St), and 

(3) the constants c(/i), df and d~ can be evaluated for every 1 < i < m. 
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Now, by (pTTTft . evaluating E(W(£(s))/j) boils down to evaluating the following 
two probabilities 

(33) (n+ ®?7){A(s)- Z >0}, and (^r ® rj) {A(s) - Z > 0}, s £ M m . 

Instead, we use the Vapnik-Cervonenkis theory, described in Subsection 16 . II in the 
Appendix, to set up a Monte-Carlo scheme to estimate them for all s £ R m with 
uniform precision. The key to this is to observe the trivial equality 

J2s jV (f 3 )-Z>0 

and apply Dudley's Theorem, Theorem 16. II in the Appendix, with X = Cl X [T] 
and the vector space G to be linear space spanned by Z and v(fj), j = 1, 2, . . . 
Thus we infer that the collection of sets 



jj^Gttx [T] xl : f>^(£)(£) 



prn+1 



(;-!:,) ^ o . [J] . r : 2 . '"</, ><-'> + / ,„ + i j o^. / = 

has a VC dimension not more than (to + 1). From 134(1 . the collection of sets 

{{A(s) - Z > 0}, s e R m } 

is contained in ((35(1 . and hence also has a VC-dimension not more than (to + 1). It 
is hence possible to estimate the probabilities in ((33(1 . uniformly for all s £ M. m , by 
drawing independent samples from distributions /j,f ®r) and /i~ <£> rj. 



Our aim now would be to apply Theorem 16.31 We first have to choose two 
positive parameters, e and 8, determining the precision of our estimates. Now, for 
every i = 1, 2, . . . , to, choose nf such that 

(36) 4 K )- + .. exp (- 2< (^) 2 + 4 (^) + 4(^) 2 )<, 

Generate many iid samples {(u>j, tj, Zj) € ft x [T] x R, j — 1, 2, . . . , n^}, from 
the joint distribution (8 to 

Remark 2. It is fairly standard to generate samples from measures fxf, defined 
through their unnormalized densities given in l(31a(l . We can either directly identify 
the distribution, as we do in the next section. Or, under the assumption that one 
can generate perfect samples from the underlying distribution (P ® Vr), one can use 
any of the standard Markov Chain algorithms, from the simple rejection sampling, 
to the general Metropolis-Hastings algorithm to generate samples from [xf . Several 
books, e.g. (GCSR031 Chap. 11], describe the details of all these algorithms. 

Let '(■) denote the empirical estimates of probabilities by the sample frequency. 
For example, for any s £ R m , we have 

(37) £+{X s -Z>0} = -l^N{A ti ( S )(^-) - zj > 0}. 
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We can now apply Ij53(l from Theorem l6.3l to claim that under the joint distribution 
of all the nf many samples drawn 

(38) Prob ( sup d+\ £+{X s - Z > 0} - (p+ ® tj){X s - Z > 0} | > e 1 < 5, Mi. 
Exactly in the same way one can replace the pf by p~ above, compute k~ by 

(39) 4( *«W (£) + 4 (i)") < { , 
and obtain estimates £~, analogous to l|37l) . which satisfies 

(40) Prob \ sup | £r{\ s - Z > 0} - (/xr g r?){A s - Z > 0} | > e I < <5, Vi. 

From (|38[) and (|40|) . it follows, by using (|30|l . that one can estimate the quantity 
E(-W(e(*))/<)+aity 

(41) Vi(s) = -df£f{X(s) - Z > 0} + dr£r{A(«) - Z > 0} - c(/«) + a, 

Since p(VF(^(s))) = sup 1<i<m {E(— W (£(s)) fi) + on} , it follows that a good estimate 
ofp(W($(*))) is 

p(s) = sup T>i(s). 

i 

We can sum-up this approximation by a simple union bound using (|38|) and 14U|) 
as follows. 

Under the joint distribution of all the {nf , K^}i<i< m samples drawn from the 
distributions {pf (£> 77, /i~ (8 <7}i<i<m, one has 

Prob j sup \p(s) - p(T^(C(s))) | > el > 

Here, the number H (< 2m) is described in i|32|) . We use the number H and not the 
crude bound 2m to bring more efficiency in our estimate. 

Now that we have estimated p(W(£(s))) for every s € M. m with uniform precision, 
we can carry out the grid searching procedure described at the beginning of this 
section. We minimize p(s) over the grid nodes (say G) to obtain 

w o = inf p( s ) = p( s *)- 

Then, with a probability more than (1 — K<5), we have 

p(W(Z( S *))<p(s*)+e<w* +e. 

In other words, with a high probability of being correct, we get 

p(w* +W(Z(s*))<e. 

Thus one obtains a near-optimal pair (wq,£) which satisfies © and p(wo + W(£)) 
is almost non-positive. The next section displays the entire method through an 
explicit example. 
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4. Examples 

The previous theory is now applied to an explicit example where stock prices 
follow geometric Brownian motion, but observed only at finitely many time points. 

We consider T — 3 and f2 = R T , the cr-algebra Tt being generated by the first 
t co-ordinates of oj £ ft. We take To to be the trivial cr-algebra {0, ft}. Take P to 
be the product probability measure of T many independent Normal distributions 
with mean zero and variance one. In other words, we consider random variables 
(Zx, Z2, • ■ ■ , Zt) such that each Zi is independent and identically distributed as 
iV(0, 1). The discounted stock price movement, under P, is described by 

Zt+i 



(42) S = 4, St+i = S t exp 

In other words, we have 



2 



t = 0,l,...,T- 1. 



(43) S t = So exp 



t 



K 2 

i=l 



t = l,2,...,T. 



However, the investor is not entirely certain of his modeling assumptions, and so 
considers other scenarios Q\ and Q2, where Q\ and Q2 are two probability measures 
defined on (ft, Tt) by 

under Qi, Z u . . . , Z T ~ N(l, 1), 

under Q 2 , Z u . . . , Z T ~ N(-l, 1). 
For convenience we also introduce Q3 = P. 

Remark 3. Note, from the effect of changing measure on the stock price 

movements. For Qx, the geometric Brownian motion gets a positive drift, for Q2 it 
gets a negative drift, while Q3 is the same as P, where stock prices are a martingale. 

Assume that various constraints dictate that his trading strategy is bounded 
between zero and one throughout, i.e., in the notation of (JSj, we have 

a t = 0, b t = 1, for all < t < T - 1. 

Now, the investor sets to do the following: if the conditions are favorable, and 
the stock prices tend to go up under Qi, he wants a large lower bound e 4 for his 
expected terminal wealth. On the other hand, if the stock prices tend to go down, 
under Q2, he sets a lower bound for his expected losses, by setting that his final 
expected wealth should be more than e _1 . He has at least $0.2 to invest, and would 
like to know an optimal initial capital, and a trading strategy to achieve his goals. 

This requires us to define a measure of risk p: if X is measurable with respect 
to Tt, then 

p(X) = max \E Q ' (-X) + aA, m = 3, 

i=l,2,3 

where 

a% = e 4 , a2 = e _1 , = 0.2. 
Then, we would like to compute a near-optimal pair (wq , £) of initial capital wq and 
< Ct < 1, for all < t < T - 1, such that 

p(w + W(0) < w +EQ*[W(Z)} >Oi, i= 1,2,3. 
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The first step will be to compute the functions /i , /2, and fa . They are straight- 
forward since 



(44) 



/i(zi, ...,z k ) = dQi/dP = exp 

fs(zi, ...,Zk) = dQ 2 /dP = exp 
h{zi,...,z k ) =dQ 3 /dP= 1. 



lk=l 



£>-T/2 

T 

J2 z k~ T/2 



fe=i 



We can now compute the functions Vt(fi). These are given by 



(45) 
vt(h) 



= E[f 1 (S t+1 -S t ) \F t ] 

= S t E [ft (exp(Zt+i - 1/2) - 1) | T t \ , from @2), 

/ t \ / ( T ~\ 



S t exp [ Z k ) E ( exp <j £ Z fc - T/2 j [exp(Z t+1 - 1/2) - 1] J , 



U-i 



where the last equality is due to l|44|) and the independence of {Zi}. Recall 
that if Z follows N(0, 1), then E [exp(crZ)] = exp(cr 2 /2), a G M. Thus, for 
z = (zi, Z2, . . . , zt) € f2, a straightforward computation leads to 



(46) 



zZ Zk 

lk=l J 



<*1>( l-tUexpf-l 



v t {j i)(z) = S^exp 

= 4(e-l)exp<j2^z fe -i|>, by ® 



In particular, we have E(ii t (/i)) — 4(e — 1)E 
Similarly, we compute 



exp 



(2£Li 



6.87e*. 



vtih) = E [/ 2 (5 i+1 - S t ) | ^ t ] 

= S t E [/ 2 (exp(Z t+1 - 1/2) - 1) | ^] , from g2), 

= S t exp ( - ^ Z fc ) E ( exp j- ^ Z fc - T/2 1 [exp(Z m - 1/2) - 1] 



fe=i 



fe=t+i 



e- 1 



(47) = -S cxp(-t) = -4(e - 1) exp(-t - 1) 



And obviously, since St is a martingale under Q3, we have 



v t (f 3 )=E[S t+1 -S t \Ft)=0. 
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Hence, for s = (si, s 2 , S3) € R 3 , the random variable Xt{s) is given by 



A t (s) =4e-*(e- 1) 
= 4e"*(e- 1) 
= 4(e-l) ,s. 



si exp |2y^ z fc 

2 



s 2 exp(-l) 



sie 

St 
So 



) - s 2 exp(-l) 



s 2 e 



-t-i 



Thus, for 1 < t < 2 and z = (zi, z 2 , 2:3) 6 f2, we have the following table: 



v+(f 1 )(t,z)=v t (f 1 )(z), 

v+(f 2 )(t,z)=0, 

u+(/ 3 )(*,z)=0, 



4 



4 = o, 



76.34, 
0, 



«-(/i)(*,a)=0, 
w-(/ 2 )(t,z)=2.53e- t > 
« + (/ 3 )(*,a) = 0, 



0. 

3.80, 



4 = 0. 



From above and Q32[l. we also have H = 2. Clearly, we need to consider only two 
changes of measures, the one given by w + (/i) and the other by t> _ (/ 2 ). The rest are 
all zero measures. Finally, since a* = 0, from (|19bp . we get c(/i) =0, i = 1,2, 3. 
We take the precision parameters to be 



.5, 



.05. 



From (|36|l and l|39|l . we determine a sufficient number of samples for desired accu- 
racy would be 

k+ = 1,400,000, K 2 = 10,500. 

Let us now analyze the probability measures [if and /x~ on M 3 x {0, 1, 2}. If z € R 3 , 
and < t < 2, then from (|31aJ) and |@SJ| we get 



(48) 



df4(x,t) cx u+(/i)(z,t)-d(P(g)C/ T )(z,t) 



oc exp < 2 Zk — t 



tx e 



^y-pf-ip^-i'-st'i}- 

7 1 fe=i fc=t+i j 



Thus generating a sample from /if is the same as picking a ( e (0, 1,2) randomly 
with probability proportional to exp(i). Then, conditionally on t, we generate t 
independent samples Z\,...,Zt from N(2, 1), and 3 — t samples from N(0, 1). 
Simulating from ^ is even simpler, since, from (|47|l . we get that 



(49) 



d/z"(z,i) cx v-(/ 2 )(a,t)-d(P®tf T )(M) 

3 



oc e 



1 



/2vr 



exp 



fe=i J 



Here, we pick £ from {0, 1, 2} with probability proportional to exp(— t), and generate 
(Z\, . . . , Zt) as independent and identically distributed samples from N(0, 1). 
Finally, we take i] to be N(0, 1). 
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result OF simulations. We first generate the required number of samples from 
Hi and H2 and set them aside. Now we choose a variety of grids, making them 
finer and more localized as we proceed, until p converges to a global minimum. 

An estimate of the minimum capital is Wq = 0.41. The optimal grid point comes 
to Si = 0.05, S2 = 9.65. Thus, an estimate of the trading strategy for this capital is 
£* = $ (At), where $ is the standard normal cumulative distribution function, and 
At is the process given by 



At = 4(e - 1) 



.05 



9.65e" 



In other words, with a probability more than (1 — K<5) 
P« + W(r))<e = 0.5. 



.9, we will indeed have 



5. Conclusion 

We devise a Monte-Carlo algorithm to compute near-minimal initial capital re- 
quirement and a suitable trading strategy to achieve acceptability at a future date. 
The benefit of this approach is that it gives precise numerical values for portfolio 
optimization problems where purely theoretical methods (e.g. backward induction, 
linear programming) fail. 

The primary shortcoming is that this approach requires intensive computing, 
mainly due to bound J53J). However, the fault lies in the crudeness of the exact 
theoretical bound, and not in the method itself. In fact, there are better bounds 
(e.g. due to Talagrand |Tal94j ) which, unfortunately, lack exact constants. 

A related problem (brought to the authors attention by Prof. R. Jarrow at the 
CCCP conference, 2006) is the following. Suppose we have two risk measures p\ and 
P2- Can we find a pair (wqjC*) of capital requirement and trading strategy, such 
that (wqjC*) minimizes p\ among all pairs (?«,£) for which P2(w,£,) is non-positive 
? The author believes that the method in this paper can be suitably extended, and 
is currently involved in such a project. 



6. Appendix 

6.1. Uniform law of large numbers. We briefly mention here three basic the- 
orems about the theory of uniform law of large numbers and the related concept 
of Vapnik-Cervonenkis dimensions. This is a subject in itself and we shall use very 
little of it for our purpose. Hence we shall skip all details and refer the reader to 
the excellent book [DCL96I Chap. 12], from where our propositions in this section 
have been lifted. 

Notation 6.1. We consider a probability space (0,9, g), where 9 is a complete, 
separable metric space. On 9™, let g n denote the product probability measure 
on the product cr-algebra. Similarly on 9°° :— N , let g°° denote the infinite 
product probability. For any 9 £ 9°°, and any n G N, define the random empirical 
measure: g n (C) :— l/ n X)iLi l(0*eC)) CsS, or, for any 3-integrable function /, 
the corresponding random expectation g n {f) ■— l/ n S™=i f(@i)- 

For any C € 3 and any e > 0, the law of large numbers dictate 
(50a) lim g°° (\g n {C) - g{C)\ > e) = 0. 
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However, if we have a collection of {C a }aei of sets in 3, it is not always true that 
(50b) lim B °°( sup \g n (C a ) - g{C a )\ > e) = 0. 

Equality above can be achieved under proper conditions on the collection {C a } a< =i, 
and then we say Uniform Law of Large Numbers (ULLN) holds. The Vapnik- 
Cervonenkis theory provides one such condition. Its strength lies in that the con- 
dition on {C a } a £i is combinatorial in nature, and hence independent from the 
choice of g. (This sometimes can also be a weakness, since significant improve- 
ments can be made for specific choice of g.) The theory begins with the concept of 
shatter- coefficient. 

Definition 6.1. Let {C a } a ^i be a collection of 3-measurable subsets of O. For 
(9i, ... , 6d) € Q d , let Af(0i, ■ ■ ■ , 9d) be the number of different sets in 

{ {e 1 ,...,e d }nc a , aei }. 

The d-th shatter coefficient of the collection {C a } a ^i is defined as 

s d = max J\[(0x,...,6 d ). 
(0i,...,e d )ee d 

In other words, the shatter coefficient is the maximal number of different subsets 
of d points that can be picked out by the class {C a } a £i. 

Remark 4. Note that we have deliberately suppressed mentioning the class {C a } ae i 
in the notation for the shatter coefficient. This is really for notational clarity. The 
shatter coefficient is clearly a property of the collection of sets we consider. 

The following theorem can be found in |DOL96l Thm 12.5, p. 197]. 

Theorem 6.1. For any collection {C Q } Qe / ; and for any n € N, e > 0, we have 

(51) |sup|e„(C Q ) - g{C a ) | >e| < 8s n exp(~ne 2 /32), 

where the constant s n is the nth shatter coefficient of the collection {C a } a ^i and 
is independent of the probability measure g. 

Hence l|50bll will hold if the constants s n grows at most polynomially. This is 
achieved for certain collections of sets which have a finite Vapnik-Ceronenkis (VC) 
dimension. The following definition is from DGL96 p. 196]. 

Definition 6.2. As before we consider the collection {C a } a ei of 3-measurable 
subsets of 6. The largest positive integer for which Sd = 2 d is known as the VC 
dimension of the collection {C a } ae i. If s d = 2 d for all integers d > 1, we then 
define the VC dimension to be oo. 

The next lemma |DGL96I p. 218] describes a fundamental relationship between 
VC dimension and the shatter coefficients. 

Sauer's Lemma. Let {C a } a ei be a subset o/3 with finite VC dimension V > 2. 
Then for all n > 2V, we have s n < n v . 

Thus Theorem 16. II together with Sauer's Leamma will yield the following. 
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Theorem 6.2. Let (6,3) be a measurable space. Let {C a } a ei be any collection 
of measurable subsets of Q with a finite VC dimension V. Then for any probability 
measure g on (0, 9) and any n > 2V, we have 

(52) jsup|g„(C Q ) - g(C a ) | >e| < 8n v exp(-ne 2 /32). 

In particular, ]im n _ too £>°° {sup aeI \g n (C a ) - g{C a ) | > e} = 0. 

The following better bound is from Devroye (1982). 
Theorem 6.3. In the setting of the previous theorem \6 . 6 A we have 

g°° |sup|e„(C Q ) - g(C a ) \ > ej < 4s„2 exp{-2ne 2 + 4e + 4e 2 ). 

Hence, by Sauer's Lemma, 

(53) g°° |sup|e„(C Q ) - f?(C Q ) | > ej < 4n 2Vc exp(-27ie 2 + 4e + 4e 2 ). 

Finally, we shall need the following collection of sets with finite VC dimension. 

Proposition 6.1. jDud78l Thm 7.2] Let G be a d-dimensional real vector space of 
real functions on an infinite set X. Define the class of sets 

C = {{x€X:g(x)>0}:geG}. 

Then the VC dimension of C is not more than d. 

References 

[ADEH99] P. Artzner, F. Delbaen, J.M. Eber, and D. Heath, Coherent measures of risk., Math. 
Finance 9 (1999), 203-228. 

[BEK05a] P. Barrieu and N. El Karoui, Inf- convolution of risk measures and optimal risk trans- 
fer., Finance & Stochastics 9 (2005), 269-298. 

[BEK05b] , Pricing, hedging and optimally designing derivatives via minimization of risk 

measures., To appear in Volume on Indifference Pricing, Princeton University Press, 
2005. 

[CGM01] P. Carr, H. Geman, and D. Madan, Pricing and hedging in incomplete markets., J. 
Financial Economics 62 (2001), 131-167. 

[CHOI] A. Cerny and S. Hodges, The theory of good-deal pricing in incomplete markets., Math- 
ematical Finance - Bachelier Congress 2000, Springer- Verlag, Berlin, 2001, pp. 175- 
202. 

[CSR00] J. H. Cochrane and J. Saa-Requejo, Beyond arbitrage: Good-deal asset price bounds 
in incomplete markets, Journal of political economy 108 (2000), no. 1, 79-119. 

[DGL96] L. Devroye, L. Gyorfi, and G. Lugosi, A probabilistic theory of pattern recognition., 
Applications of Mathematics, vol. 31, Springer- Verlag, New York, 1996. 

[Dud78] R. M. Dudley, Central limit theorems for empirical measures., The Annals of Proba- 
bility 6 (1978), no. 6, 899-929. 

[FS02] H. Follmer and A. Schied, Convex measures of risk and trading constraints., Finance 
& Stochastics 6 (2002), 429-447. 

[FS04] , Stochastic finance: An introduction in discrete time, second ed., Studies in 

Mathematics, vol. 27, de Gruyter, Berlin., 2004. 

[GCSR03] Andrew Gelman, John B. Carlin, Hal S. Stern, and Donald B. Rubin, Bayesian data 
analysis, second ed., Chapman & Hall/CRC, London, 2003. 

[HeaOO] David Heath, Back to the future, Plenary lecture at the First World Congress of the 
Bachelier Society (Paris), 2000. 

[JK01] S. Jaschke and U. Kiichler, Coherent risk measures and good-deal bounds., Finance & 
Stochastics 5 (2001), 181-200. 



18 



SOUMIK PAL 



[Lch86] L. Lchmann, E., Testing statistical hypotheses, second ed., Wiley Series in Probability 

and Mathematical Statistics, 1986. 
[Sta04] J. Staum, Fundamental theorems of asset pricing for good deal bounds., Mathematical 

Finance 14 (2004), no. 2, 141-161. 
[Tal94] M. Talagrand, Sharper bounds for gaussian and empirical processes., The Annals of 

Probability 22 (1994), no. 1, 28-76. 

Department of Statistics, Columbia University, 1255 Amsterdam Avenue, New York, 
NY 10027 

Current address: Departments of Mathematics and ORIE, 506 Malott Hall Cornell University, 
Ithaca NY 14853 

E-mail address: soumlkOmath.cornell.edu 



